A Structural Approach to Indexing Triples
نویسندگان
چکیده
As an essential part of the W3C’s semantic web stack and linked data initiative, RDF data management systems (also known as triplestores) have drawn a lot of research attention. The majority of these systems use value-based indexes (e.g., B-trees) for physical storage, and ignore many of the structural aspects present in RDF graphs. Structural indexes, on the other hand, have been successfully applied in XML and semi-structured data management to exploit structural graph information in query processing. In those settings, a structural index groups nodes in a graph based on some equivalence criterion, for example, indistinguishability with respect to some query workload (usually XPath). Motivated by this body of work, we have started the SAINT-DB project to study and develop a native RDF management system based on structural indexes. In this paper we present a principled framework for designing and using RDF structural indexes for practical fragments of SPARQL, based on recent formal structural characterizations of these fragments. We then explain how structural indexes can be incorporated in a typical query processing workflow; and discuss the design, implementation, and initial empirical evaluation of our approach.
منابع مشابه
On Schur Multipliers of Pairs and Triples of Groups with Topological Approach
In this paper, using a relation between Schur multipliers of pairs and triples of groups, the fundamental group and homology groups of a homotopy pushout of Eilenberg-MacLane spaces, we present among other things some behaviors of Schur multipliers of pairs and triples with respect to free, amalgamated free, and direct products and also direct limits of groups with topological approach.
متن کاملAnalyses of RDF Triples in Sample Datasets
Linked Data principles supported especially by RDF triples appeared recently to enrich the Web of Documents by the Web of Data. However, each application that wants to process RDF triples has to deal with their distribution, dynamics and scaling. Thus, having understood structural and other features of such data, we may have better chances to propose these applications more efficiently. Especia...
متن کاملSchemEX - Efficient construction of a data catalogue by stream-based indexing of linked data
We present SchemEX, an approach and tool for a stream-based indexing and schema extraction of Linked Open Data (LOD) at web-scale. The schema index provided by SchemEX can be used to locate distributed data sources in the LOD cloud. It serves typical LOD information needs such as finding sources that contain instances of one specific data type, of a given set of data types (so-called type clust...
متن کاملA Graph-based Approach to Indexing Semantic Web Data
To the best of our knowledge, existing Semantic Web (SW) search systems fail to index RDF graph structures as graphs. They either do not index graph structures and retrieve them by run-time formal queries, or index all row triples from the back-end repositories. This increases the overhead of indexing for very large RDF documents. Moreover, the graph explorations from row triples can be complic...
متن کاملA formal framework for image indexing with triples: Toward a concept-based image retrieval
This paper provides a formal speciication for concept-based image retrieval using triples. To eeectively manage a vast amount of images, we may need an image retrieval systems capable of indexing and searching images based on the characteristics of their content. However, such a content-based image retrieval technique alone may not satisfy user queries if retrieved images turn out to be relevan...
متن کامل